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Nucleic acids coding for enzyme activities of spinosyn 
biosynthesis 

The present invention relates to nucleic acids coding 
for enzyme activities of spinosyn biosynthesis and to 
the relevant enzymes per se. 

Spinosyns represent a novel group of macrolide 
compounds which have been isolated from the 
actinomycete Saccharopolyspora spinosa (Mertz and Yao, 
1990). They are used for controlling insects 
(WO "97/00265, WO 94/20518, WO 93/09126, US 5670364, 
US 5362634, US 5227295, US 5202242). Spinosyns exhibit 
a strongly insecticidal but no antibacterial activity, 
whereby they can be distinguished from conventional 
macrolides such as tylosin, spiramycin and 
erythromycin, which have no insecticidal but 
antimicrobial activity. 

The spinosyn structure is composed of a tetracyclic 
polyketide skeleton (aglycone) with a 12-membered 
macrolide ring and a 5 , 6 , 5-cis-anti- trans- tricycle and 
also a D-forosamine sugar moiety and a 2,3,4-tri-O- 
methyl-L-rhamnose sugar moiety (Kirst et al . , 1991). 
More than 20 different natural spinosyn derivatives, 
the "A83543" complex, have hitherto been, described 
(WO 97/00265, WO 94/20518, WO 93/09126). These 
derivatives differ in the substitution of one or some 
methyl groups on the tetracyclic skeleton, on the 
forosamine sugar moiety or the trimethylrhamnose sugar 
moiety. A 17-pseudoaglycone which lacks the forosamine 
sugar moiety has likewise been isolated from S. spinosa 
culture broths. 

The main components of the A83 543 complex formed by 
S, spinosa are the variants spinosyn A and spinosyn D 
which represent the essential components of the product 
Spinosad (cf. Pesticide Manual, British Crop Protection 
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Council, 11th Ed., 1997, page 1272 and Dow Elanco trade 
magazine Down to Earth, Vol. 52, . No- 1, 1997, and the 
references cited therein). 

5 Based on studies on incorporating 13 C-labelled acetate, 
propionate, butyrate or isobutyrate it was possible to 
show that the A83543 biosynthesis follows a polyketide 
biosynthesis path (Nakatsukasa et al . , 1990). 
. Polyketides are synthesized from short-chain acid 

10 building blocks such as acetate, propionate or butyrate 
by multifunctional enzymes, the "'polyketide synthases'' 
(PKSs). They catalyse, similar to the related fatty 
acid synthases (FASs) , decarboxylating polycondensation 
steps of the building blocks activated as CoA 

15 thioesters. While FASs after each condensation step 
catalyse, by keto reduction, dehydration and enoyl 
reduction, a complete reduction of the (5-oxoesters 
which are formed intermediately on the growing 
polyketide chain, PKSs can dispense with particular 

20 reduction steps. Modular type I PKSs consist of one or 
more large multifunctional proteins. In contrast, 
iterative type II PKSs are a complex composed of 
substantially monof unctional proteins . 

25 The enzymatic activities of modular type I PKSs can be 
combined to "modules". Here, a module carries an 
arrangement of three enzyme-catalytically active 
domains which lead to an elongation of the growing 
polyketide chain by a biosynthetic elongation unit. The 

30 said domains are a p-ketoacyl : acyl carrier protein 
synthase domain, an acyl transferase domain and a 
p-ketoacyl : acyl carrier protein domain. A module may 
also carry a ketoreductase domain, a dehydratase 
domain, an enoylreductase domain and a thioesterase 

35 domain. A "loading" module at the start of biosynthesis 
is able to carry from the said domains only an 
acyltransf erase domain and a p-ketoacyl : acyl carrier 
protein domain and also an enzymatically inactive 
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(J— ketoacyl :acyl- carrier protein synthase domain. A 
polyketide synthase domain comprises one each of the 
said enzymatic activities. 

5 Owing to the potent insecticidal activity and the 
remarkable structure of the spinosyns, there is great 
interest in decoding the genetic information for their 
biosynthesis . . 

10 The invention relates to nucleic acids which comprise 
. at least one region coding for an enzyme activity which 
is involved in biosynthesis of spinosyns. 

The present invention provides a cluster of open 
15 reading frames (ORFs) whose translation products are 
involved in biosynthesis of spinosyns. It furthermore 
provides additional genes or ORFs which are located 
outside the approx . 120 kb spinosyn biosynthesis 
cluster and whose translation products are involved in 
20 biosynthesis of rhamnose sugars. 

The nucleic acids of the invention are in particular 
single- stranded or double-stranded deoxyribonucleic 
acids (DNA) or ribonucleic acids (RNA) . Embodiments 
25 which are preferred are genomic DNA fragments and 
cDNAs. 

The term "at least one region", as used herein, means 
that the nucleic acid of the invention may comprise one 
30 or more sequences which each code for individual 
activities that carry out steps in the synthesis of 
spinosyns. Accordingly, nucleic acids which code only 
for a single enzyme activity in spinosyn biosynthesis 
are regarded as inventive, too. 



35 



The term "enzyme activity", as used herein, means that 
it is possible to express, starting from the nucleic 
acids studied herein, at least that part of a complete 
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enzyme, which still exerts the catalytical properties 
of the enzyme. 

The nucleic acids of the invention code in particular 
5 for enzyme activities of polyketide synthases, methyl- 
. transferases, epimerases, glycosyltransf erases , amino- 
transferases, dimethyltransf erases, reductases, 
dehydratases and/or cyclization enzymes. 

10 The nucleic acids of the invention are preferably DNA 
fragments corresponding to S. spinosa genomic DNA. 

The nucleic acids of the invention particularly 
preferably comprise at least one sequence selected from 

15 

(a) the sequences according to SEQ ID NOS : 1, 2, 3, 4, 
5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, .29, 
31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 52 or 
54, 

20 

(b) part sequences of at least 14 base pairs in length 
of the sequences defined under (a) , 

(c) sequences which hybridize to the sequences defined 
25 under (a) , 

(d) sequences which are at least 70%, preferably 80%, 
particularly preferably 90%, identical to the 
sequences defined under (a) , 

30 

(e) sequences which are complementary to the sequences 
defined under (a), and 

(f) sequences which, due to the degeneracy of the 
35 genetic code, code for the same amino acid 

sequence as the sequences defined under (a) to 
(d) . 



10 
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The term "hybridize", as used herein, describes the 
process in which a single-stranded nucleic acid 
molecule undergoes base pairing with a complementary 
strand. In this way it is possible, for example 
starting from genomic DNA from organisms, which are 
phylogenetically related to S. spinosa and are capable 
of spinosyn biosynthesis, . to isolate DNA fragments 
which have the same properties as the fragments 
isolated from S. spinosa . 



Preferred hybridization conditions are indicated below: 
hybridization solution: 5 x SSC; blocking reagent 
(Roche Diagnostics GmbH, Mannheim, Germany) , 1%; 
N-lauroylsarcosine, 0.1%; SDS (sodium dodecyl sulphate) 
15 0.02%; hybridization temperature: 60°C; first washing 
step: 2 x SSC at 60°C; second washing step: 2 x SSC at 
60°C; preferred second washing step: 0.5 x SSC at 60°C; 
particularly preferred second washing step: 0.2 x SSC 
at 60°C. 



The degree of identity of the nucleic acids is 
preferably determined with the aid of the GAP program 
from the GCG program package (Devereux et al . , 1984), 
Version 9.1, under standard conditions. 

Particular emphasis is on nucleic acids which comprise 



(1) either all sequences coding for steps in 
forosamine and trimethylrhamnose biosyntheses , in 

30 particular the sequences according to SEQ ID NOS : 

4 and 51, or 

(2) all sequences coding for polyketide synthesis 
steps, in particular the sequences according to 
SEQ ID NOS: 5 and 6, or 

35 (3) all sequences coding for all steps in forosamine, 
trimethylrhamnose and polyketide syntheses, in 
particular the sequences according to SEQ ID NOS: 
1, 2, 3 and 51. 
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All DNA sequences required for spinosyn biosynthesis or 
for the synthesis of precursors as defined below may 
therefore be located on a single vector. These nucleic 
acids, however, may also be present on . two or more 
vectors and be expressed simultaneously or successively 
in a host cell . 

All ORFs of the nucleic acids of the invention may be 
switched on by their own promoters or by heterologous 
promoters. 

The present, invention also relates to the regulatory 
regions, which control transcription of the nucleic 
acids of the invention naturally, i.e. in the original 
organism S. spinosa. 

The term ''regulatory regions", as used herein, relates 
to promoters, repressor or activator binding sites, 
repressor or activator sequences, and terminators. This 
term likewise includes furthermore genetically mobile 
elements which occur naturally, i.e. in the original 
organism S. spinosa. Such genetically mobile elements 
may be transposable or mobilizable elements or 
functional parts thereof, IS elements or other 
insertion elements. This term furthermore also includes 
amplifiable DNA elements (amplifiable units of DNA, 
AUD; Fishman and Hershberger, 1983) which occur 
naturally, i.e. in the original organism S. spinosa. 
The invention also relates to any combination of these 
regulatory regions with one another or with 
heterologous DNA fragments such as, for example, 
promoters, repressor or activator binding sites, 
transposable, mobilizable or transducible elements. 



The present invention furthermore relates 
constructs which comprise at least one nucleic 
the invention and a heterologous promoter. 



to DNA 
acid of 
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The term "heterologous promoter", as used herein, 
relates to a promoter which does not control expression 
of the corresponding gene (ORFs) in the original 
organism. 

The selection of heterologous promoters depends on 
whether prokaryotic or eukaryotic cells or cell-free 
systems are used for expression. A preferred example of 
a heterologous promoter is the promoter of the mel gene 
from the vector pIJ702 (The John Innes Foundation, 
Norwich, UK 1985) . Heterologous expression may be used, 
for example, in order to achieve an increase in 
spinosyn production compared with the natural spinosyn 
producer . 



The invention further relates to vectors which contain 
at least one of the nucleic acids of the invention. 
Vectors which may be used are all phages, plasmids, 
phagemids, phasmids, cosmids, YACs, BACs, PACs 
artificial chromosomes or particles suitable for 
particle bombardment, which are used in molecular- 
biological laboratories. 

Preference is given to BAC vectors. BAC (bacterial 
artificial chromosome) vectors have been developed for 
cloning of large DNA fragments (Shizuya et al . , 1992). 
They are single-copy plasmids with a factor F origin, 
which can harbour DNA fragments of an average size of 
120 kilo base pairs (kb). They can be replicated in 
Escherichia coli. The BAC vector pBeloBACll (Kim et 
al., 1996) carries a T7 and an SP6 promoter which flank 
the cloning site and can be used as starting region for 
sequencing primers and for generating RNA transcripts. 



Particular preference is given to the 
clones which the present invention relates 
have been deposited with the Deutsche 



BAC shuttle 
to and which 
Sammlung von 
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Mikrobrganismen und Zellkulturen GmbH (DSMZ) , 
Mascheroder Weg lb, D-38124 Brunswick under deposition 
numbers DSM 13010, DSM 13011 and DSM 13012 in 
accordance with the requirements of the Budapest Treaty 
5 on 18 August 1999. 

The deposited BAC shuttle clones P11/G6, P8/G11 and 
P11/B10 each carry an S. spinosa DNA fragment of at 
least 100 kb in size. The clones P11/G6 and P11/B10 

10 each carry a part of the nucleic acid sequence 
according to SEQ ID NO: 4 and also the neighbouring 
complete nucleic acid sequences according to SEQ ID 
NOS : 5 and 6 and also a DNA region 3' adjacent to the 
nucleotide sequence according to SEQ ID NO: 6 (Fig. 7). 

15 The clone P8/G11 carries a part of the nucleic acid 
sequence according to SEQ ID NO: 6, the complete 
nucleic acid sequences according to SEQ ID NOS: 5 and 4 
and also a DNA region 3' adjacent to the sequence 
according to SEQ ID NO: 4 (Fig. 7) . 

20 

Vectors which are in the same way suitable for spinosyn 
production are also PAC vectors and all other 
functionally equivalent vectors which allow 

transferring large DNA fragments, in particular those 

25 DNA fragments larger than 30 kb, preferably larger than 
40 kb, particularly preferably larger than 60 kb, into 
heterologous host cells and ensuring establishing of 
foreign DNA there. Preference is given to using those 
BAC, PAC and functionally equivalent vectors which have 

30 been modified into a shuttle vector and allow, for 
example, plasmid replication both in Gram-negative 
bacteria such as Escherichia coli and in Gram-positive 
bacteria such as Streptomyces . Such preferred shuttle 
vectors may carry DNA fragments of a size which cannot 

35 be cloned into conventional vectors such as, for 
example, cosmid vectors and cannot be transferred into 
heterologous hosts such as act inomycetes , for example 
s treptomycetes . The latter vectors may be transferred 
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both, by transformation/ conjugation, electroporation, 
protoplast transformation or by other suitable methods . 
It is in an excellent way possible to transfer those 
shuttle vectors within a heterologous population of 
5 Gram-negative or Gram-positive bacteria, between Gram- 
positive and Gram-negative bacteria, between bacteria 
and Archea and between prokaryotes and eukaryotes by 
conjugation. The BAC, PAC or functionally equivalent 
shuttle vectors, transferred into heterologous hosts 

10 such as, for example, streptomycetes, may be replicated 
autonomously or integrated into the host genome. The 
latter integration may be carried out via homologous 
recombination, via a d>C31 integration mechanism 
(Hopwood et al., 1985), via location-specific 

15 integration which depends on pSAM2 (Smokvina et al . , 
1990; WO 95/1604 6) -determined functions or via mini- 
circle-mediated functions (Motamedi et al., 1995; WO 
96/00282) . 

20 Such shuttle vectors allow expression of specific 
biosynthetic pathways of primary or secondary 
metabolites, which are determined by extraordinarily 
large DNA regions, by transferring a single recombinant 
vector heterologously into particularly suitable host 

25 cells. Thus it is possible to express the identified 
cluster for spinosyn biosynthesis in organisms such as 
actinomycetes, for example Streptomyces , by 

transferring a single recombinant shuttle vector. Owing 
to the size of this biosynthesis cluster, the said 

30 heterologous expression of spinosyn biosynthesis is 
impossible by using a single cosmid vector. 
Transferring a recombinant BAC, PAC or functionally 
equivalent shuttle vector, which carries the nucleic 
acids of the invention, can result in a significant 

35 increase in spinosyn production compared with spinosyn 
production in the strain S. spinosa or in derived 
mutants with increased spinosyn formation. In addition 
it is possible to use such a shuttle vector coding for 
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spinosyn biosynthesis for the purpose of utilizing, 
after transfer into heterologous host cells, their 
biosynthetic and modifying capabilities, in order to 
achieve a significant modification of spinosyn or 
5 biosynthetic precursors of spinosyn- This makes it in 
addition possible to produce novel spinosyn derivatives 
by transferring a single recombinant vector into 
heterologous host cells. 

10 It is furthermore possible to use such shuttle vectors 
for genetically modifying cloned biosynthesis pathways 
of secondary metabolites as part of a single 
recombinant shuttle vector. Such modifications may be 
carried out, for example, in an E . coli host, for 

15 example by utilizing recombination events with 
participation of the recA gene product or the recE and 
recT gene products (Muyrers et al . , 1999). Furthermore 
it is possible to modify such vectors by in vitro 
methods such as, for example, the template generation 

20 system (Finnzymes, FIN-02201, Espoo, Finland) or the 
Transposomics system (Epicentre Technologies, Biozym 
Diagnostika GmbH, Oldendorf , Germany) . Such shuttle 
vectors coding for modified biosynthetic pathways may 
then be transferred into suitable host cells in order 

25 to produce modified secondary metabolites. Analogously, 
the said shuttle vectors may be used for modifying the 
nucleic acids of the invention in order to use them 
subsequently, after transfer into suitable host cells, 
for production of modified spinosyns. 

30 

Parts of the nucleic acids of the invention may also 
determine genetic information as components of two or 
more vectors such as, for example, cosmid vectors which 
combined with one another are suitable for the 
35 biosynthesis of spinosyn or spinosyn precursors such 
as, for example, pseudoaglycone or spinosyn aglycone. 
Such combinations of recombinant vectors may be used in 
order to achieve spinosyn production in organisms other 
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than. S. spinosa. With expression in particularly 
suitable hosts, this may result in a significant 
increase in spinosyn production compared with 
S. spinosa or derived, production-enhanced mutants. It 
5 is furthermore possible to modify the nucleic acids of 
the invention in individual recombinant vectors of the 
said vector combination such that heterologous 
production of spinosyn derivatives in host cells is 
possible. Moreover, such a combination of recombinant 
10 vectors may be suited, due to their transfer into 
heterologous hosts, to forming novel spinosyn 
derivatives by utilizing the host's endogenous enzyme 
system. 

15 The present invention also relates to host cells 
containing at least one of the nucleic acids of the 
invention. Suitable host cells are both prokaryotic 
cells, preferably actinomycetes , particularly 

preferably streptomycetes , and eukaryotic cells such as 

20 mammalian cells, plant cells. or yeast cells. 

In a particular way the nucleic acids of the invention 
can be transferred into plant cells and be expressed 
there. This makes it possible to prepare transgenic 
25 plants which produce the plant-protecting, insecticidal 
spinosyn or derivatives thereof. The nucleic acids of 
the invention may be transferred into the plant cells 
or plant cell cultures by conventional methods, inter 
alia also by particle bombardment. 

30 

The present invention further relates to the 
polypeptides encoded by the nucleic acids of the 
invention. The polypeptides of the invention may 
constitute a complete enzyme which catalyses a spinosyn 
35 biosynthesis step. However, the invention also includes 
those polypeptides which have only a part of the 
complete amino acid sequence of the relevant enzyme. 
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The term "part sequence", as used herein, thus relates 
to the amino acid sequence of a polypeptide which can 
still exert the activity of the corresponding complete 
enzyme or an enzymatically active domain. 

5 

In the following, . preferred nucleic acids and 
polypeptides of the invention are characterized in more 
detail, with reference to the appropriate SEQ ID NOS. 

10 SEQ ID NOS: 7 and 8, ORF1 : . 

Nucleotide position 828 to 1 of SEQ ID NO: 4, 275 amino 
acids; the derivable gene product is a methyl- 
transferase. 

15 SEQ ID NOS: 9 and 10, ORF2: 

Nucleotide position 1283 to 2455 of SEQ ID NO: 4, 390 
amino acids; the derivable gene product is a glycosyl- 
transf erase. 

20 SEQ- ID NOS: 11 and 12, ORF3: 

Nucleotide position 2495 to 3247 of SEQ ID NO: 4, 250 
amino acids; the derivable gene product is a methyl- 
transferase . 

2 5 SEQ ID NOS: 13 and 14, ORF4 : 

Nucleotide position 4440 to 3253 of SEQ ID NO: 4, 395 
amino acids; the derivable gene product is a methyl- 
transferase . 

3 0 SEQ ID NOS: 15 and 16, ORF5 : 

Nucleotide position 4578 to 6197 of SEQ ID NO: 4, 539 
amino acids; the derivable gene product is a C-C 
linking enzyme which carries out cyclization reactions. 

3 5 SEQ ID NOS: 17 and 18, ORF6: 

Nucleotide position 6211 to 7404 of SEQ ID NO: 4, 397 
amino acids; the derivable gene product is a methyl- 
transf erase . 
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SEQ ID NOS: 19 and 20, ORF7 : 

Nucleotide position 7401 to 8300 of SEQ ID NO: 4, 299 
amino acids; the derivable gene product is a methyl- 
5 transferase. 

SEQ ID- NOS: 21 and 22, ORF8 : 

Nucleotide position 8300 to 9466 of SEQ ID NO: 
amino acids; the derivable gene product is an 
10 which is involved in cyclization reactions. 

SEQ ID NOS: 23 and 24, ORF9: 

Nucleotide position 10572 to 9562 of SEQ ID NO: 
amino acids; the derivable . gene product 
15 2 , 3-reductase . 

SEQ ID NOS: 25 and 26, ORF10: 

Nucleotide position 12029 to 10569 of SEQ ID NO: 
amino acids; the derivable gene product 
20 2 , 3-dehydratase . 

SEQ ID NOS: 27 and 28, ORF11; 
Nucleotide position 12549 to 12109 of SEQ ID NO: 4, 146 
amino acids; the derivable gene product has homologies 
25 with a thioesterase . 

SEQ ID NOS: 29 and 30, ORF12 : 

Nucleotide position 13865 to 12546 of SEQ ID NO: 
amino acids; the derivable gene product 
30 glycosyltransf erase . 

SEQ ID NOS: 31 and 32, ORF13: 
Nucleotide position 14245 to 15633 of SEQ ID NO: 4, 462 
amino acids; the derivable gene product is a 
35 3, 4-dehydratase . 



4, 388 
enzyme 



4, 336 
is a 



4, 486 

is a 



4 , 439 
is a 



SEQ ID NOS: 33 and 34, ORF14 : 
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Nucleotide position 15671 to 16828 of SEQ ID NO: 4, 385 
amino acids; the derivable gene product is a 

4 - aminotransferase . 

5 SEQ ID NQS: 35. and 36, ORF15: 

Nucleotide position 16831 to 17580 of SEQ ID NO: 4, 249 
amino acids; the . derivable gene product is an 
N-dimethyltransf erase . 

10 SEQ ID NQS: 37 and 38 , ORF16: 

Nucleotide position 18930 to 18205 of SEQ ID NO: 4/ 241 
amino acids; the derivable gene product is a 
3, 4-reductase. 

15 SEQ ID NQS: 39 and 40, ORF17 : 

Nucleotide position 19025 to 19861 of SEQ ID NO: 4, 278 
amino acids; the derivable gene product is a 
transcription regulator. 

2 0 SEQ ID NQS : 41 and 42, . ORF18 : 

Nucleotide positions 116-7903 of SEQ ID NO: 5, amino 
acid positions 1 to 2595: 

Nucleotide positions 128-1402, amino acid positions 

5- 429, encode a p-ketoacyl : acyl carrier protein 
25 synthase domain; 

Nucleotide positions 1691-2656, amino acid positions 
526-847, encode an acyltransf erase domain; 

Nucleotide positions 2798-3052, amino acid positions 
895-979, encode a p-ketoacyl : acyl carrier protein 

3 0 domain; 

Nucleotide positions 3107-4372, amino acid positions 
998-1419, encode a p-ketoacyl : acyl carrier protein 
synthase domain; 

Nucleotide positions 4688-5662, amino acid positions 
35 1525-1849, encode an acyl trans f erase domain; 

Nucleotide positions 6587-7138, amino acid positions 
2158-2341, encode a ketoreductase domain; 
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Nucleotide positions 7409-7666, amino acid positions 
2432-2517, encode a p-ketoacyl : acyl carrier protein 
domain . 

5 SEQ ID NOS: 43 and 44, 0RF19: 

Nucleotide positions 7921-14379 of SEQ ID NO: 5, amino 
acid positions 1 to 2152: 

Nucleotide positions 8029-9318, amino acid positions 
37-4 66, encode a p-ketoacyl : acyl carrier protein 
10 synthase domain; 

Nucleotide positions 9634-10608, amino acid positions 
572-896, encode an acyltransf erase domain; 

Nucleotide positions 10705-11259, amino acid positions 
929-1113, encode a dehydratase domain; 
15 Nucleotide positions 12043-13080, amino acid positions 
1375-1720,. encode an enoylreductase domain; 
Nucleotide positions 13093-13635, amino acid positions 
1725-1905, encode a ketoreductase domain; 

Nucleotide positions 13885-14142, amino acid positions 
20 1989-2074, encode a p-ketoacyl : acyl carrier protein 
domain; 

SEQ ID NOS: 45 and 46, ORF20: 

Nucleotide positions 14424-23936 of SEQ ID NO: 5, amino 
2 5 acid positions 1 to 3170: 

Nucleotide positions 14523-15824, amino acid positions 
34-467, encode a p-ketoacyl : acyl carrier protein 
synthase domain; 

Nucleotide positions 16110-17075, amino acid positions 
30 563-884, encode an acyltransf erase domain; 

Nucleotide positions 17997-18536, amino acid positions 
1192-1371, encode a ketoreductase domain; 

Nucleotide positions 18795-19052, amino acid positions 
1458-1543, encode a p-ketoacyl : acyl carrier protein 
35 domain; 

Nucleotide positions 19107-20387, amino acid positions 
1562-1988, encode a p- ket oacyl : acyl carrier protein 
synthase domain; 
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Nucleotide positions 20718-21692, amino acid positions 
2099-2423, encode an acyltransf erase domain; 
Nucleotide positions 22620-23171, amino acid positions 
2733-2916, encode a ketoreductase domain; 
5 Nucleotide positions 234 3 6-23 693 , amino acid positions 
3005-3090, encode, a p-ketoacyl : acyl carrier protein 
domain. 

SEQ ID NOS: 4 7, and 48, QRF21: 
10 Nucleotide positions 23983-38757 of SEQ ID NO: 5, amino 
acid positions 1 to 4924: 

Nucleotide positions 24082-25392, amino acid positions 
34-470, encode a p-ketoacyl : acyl carrier protein 
synthase domain; 
15 Nucleotide positions 25696-26661, amino acid positions 
572-893, encode an acyltransf erase domain; 

Nucleotide positions 26761-27315, amino acid positions 
927-1111, encode a dehydratase domain; 

Nucleotide positions 28231-28782, amino acid positions 
20 1417-1600, encode a ketoreductase domain; 

Nucleotide positions 29035-29265, amino acid positions 
1685-1761, encode a p-ketoacyl : acyl carrier protein 
domain; 

Nucleotide positions 29329-30624, amino acid positions 
25 1783-2214, encode a p-ketoacyl : acyl carrier protein 
synthase domain; 

Nucleotide positions 30928-31902, amino acid positions 
2316-2640, encode an acyl transferase domain; 
Nucleotide positions 32827-33378, amino acid positions 
30 2949-3132, encode a ketoreductase domain; 

Nucleotide positions 33652-33900, amino acid positions 
3224-3306, encode a p-ketoacyl : acyl carrier protein 
domain ; 

Nucleotide positions 33952-35262, amino acid positions 
35 3324-3760, encode a p-ketoacyl : acyl carrier protein 
synthase domain; 

Nucleotide positions 35554-36522, amino acid positions 
3858-4180, encode an acyl transferase domain; 
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Nucleotide positions 37453-37998, amino acid positions 
44 91-4 672, encode a ketoreductase domain; 

Nucleotide positions 38254-38511, amino acid positions 
4578-4843, encode a (5-ketoacyl : acyl carrier protein 
5 domain; 

SEQ ID NOS: 49 and 50, ORF22 : 

Nucleotide positions 38808-50000 of SEQ ID NO: 5 and 
the nucleotide positions 1 to 5574 of SEQ ID NO: 6, 
10 amino acid positions 1 to 5588: 

Nucleotide positions 38907-40226 of SEQ ID NO: 5, amino 
acid positions 34-473, encode a p-ketoacyl : acyl carrier 
protein synthase domain; 

Nucleotide positions 40494-41453 of SEQ ID NO: 5, amino 
15 acid positions 563-882, encode an acyltransf erase 
domain; 

Nucleotide positions 41556-42119 of SEQ ID NO: 5, amino 
acid positions 917-1104, encode a dehydratase domain; 
Nucleotide positions 43017-43568 of SEQ ID NO: 5, amino 
20 acid positions 1404-1587, encode a ketoreductase 
domain; 

Nucleotide positions 43833-44090 of SEQ ID NO: 5, amino 
acid positions 1676-1761, encode a p-ketoacyl : acyl 
carrier protein domain; 
25 Nucleotide positions 44151-45473 of SEQ ID NO: 5, amino 
acid positions 1782-2222, encode a p-ketoacyl : acyl 
carrier protein synthase domain; 

Nucleotide positions 45765-46730 of SEQ ID NO: 5, amino 
acid positions 2320-2641, encode an acyl transferase 
3 0 domain; 

Nucleotide positions 46827-47459 of SEQ ID NO: 5, amino 
acid positions 2674-2884, encode a dehydratase domain; 
Nucleotide positions 48 378-48 935 of SEQ ID NO: 5, 
amino acid positions 3191-3376, encode a ketoreductase 
35 domain; 

Nucleotide positions 49182-49412 of SEQ ID NO: 5, amino 
acid positions 3459-3535, encode a P~ ketoacyl : acyl 
carrier protein domain; 
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Nucleotide positions 49482-50000 of SEQ ID NO: 5 and 
nucleotide positions 1 to 759 of SEQ ID NO: 6, amino 
acid positions 3559-3984, encode a p-ketoacyl : acyl 
carrier protein synthase domain; 
5 Nucleotide positions 1084-2049 of SEQ ID NO: 6, amino 
acid positions 4093-4414, encode an acyltransf erase 
domain; 

Nucleotide positions 2146-2697 of SEQ ID NO: 6, amino 
acid positions 4447-4 630, encode a dehydratase domain; 
10 Nucleotide positions 3604-4155 of SEQ ID NO: 6, amino 
acid positions 4933-5116, encode a ketoreductase 
domain; 

Nucleotide positions 4420-4677 of SEQ ID NO: 6, amino 
acid positions 5205-5290, encode a p-ketoacyl : acyl 
15 carrier protein domain; 

Nucleotide positions 4864-5538 of SEQ ID NO: 6/ amino 
acid positions 5353-5577, encode a thioesterase domain. 

SEQ ID NOS: 52 and 53, ORF23: 
20 Nucleotide position 344 to 1333 of SEQ ID NO: 51, 329 
amino acids; the derivable gene product is a dNDP- 
glucose-4 , 6-dehydratase . 

SEQ ID NOS: 54 and 55, ORF2 4 : , 
25 Nucleotide position 1330 to 2247 of SEQ ID NO: 51, 305 
amino acids; the derivable gene product is a dNDP-4- 
keto-6-deoxyglucose-3 , 5-epimerase . 

The products of ORF 5 (SEQ ID NO: 16) and ORF 8 (SEQ ID 
30 NO: 22) involved in cyclization of the 5 , 6 , 5- tricycle 
are of particular interest, owing to the unusual 
cyclization reactions. The present invention therefore 
includes in particular also homologous nucleic acids or 
homologous gene products. Advantageously, these 
35 homologous gene products show at least 50%, preferably 
60% and particularly preferably 70% identity at the 
amino acid level . 
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The invention furthermore relates to antibodies binding 
specifically to the abovementioned polypeptides. Such 
antibodies are produced in the conventional manner. 
These antibodies may be used for identifying expression 
clones of, for example, a gene library, which harbour 
the nucleic acids of the invention. 

The present invention also relates to a method for 
preparing the nucleic acids of the invention. The 
nucleic acids of the invention may be prepared in the 
conventional manner. It is possible, for example, to 
chemically synthesize the entire nucleic acid 
molecules. It is also possible to chemically synthesize 
short sections of the nucleic acids of the invention 
and to label such oligonucleotides radioactively or 
with a fluorescent dye. The labelled oligonucleotides 
may also be used for screening gene libraries of 
organisms. Clones which hybridize to the labelled 
oligonucleotides are selected for isolating the 
relevant DNA. After characterizing the .isolated DNA, 
the nucleic acids of the invention are obtained in a 
simple manner. The nucleic acids of the invention may 
also be prepared by means of PCR methods using 
chemically synthesized oligonucleotides . 

The present invention furthermore relates to methods 
for preparing the polypeptides of the invention. The 
polypeptides encoded by the nucleic acids of the 
invention may be prepared by culturing host cells which 
contain at least one of the nucleic acids of the 
invention under suitable conditions. The desired 
polypeptides may subsequently be isolated from the 
cells or the culture medium in a conventional manner. 
The polypeptides may also be prepared in in vitro 
systems . 



The isolated and characterized gene 
neighbouring or associated DNA regions 



cluster and 
represent a 
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target for increasing spinosyn biosynthesis by genetic 
manipulation, over or underexpression of genes or 
regulatory sequences/ which are directly or indirectly 
involved in the biosynthesis. These manipulations may 
5 be carried out both in natural spinosyn-producing 
organisms and in genetically engineered spinosyn- 
producing organisms. Thus it is possible, for example, 
to put selected ORFs under the control of conventional 
strong promoters such as the mel promoter of plasmid 
10 pIJ702 (John Innes Foundation, Norwich, UK, 1985). 

By cloning and identifying the genes of spinosyn 
biosynthesis, the present invention provides the 
genetic base for preparing novel spinosyn precursors 
15 and spinosyn derivatives by means of molecular-genetic 
methods . 

The term "spinosyn precursors", as used herein, relates 
to all biosynthetic spinosyn precursors which can be 
20 detected or postulated. 

The term "spinosyn derivatives", as used herein, 
relates to structural derivatives of all hitherto known 
spinosyns . 

25 

The invention thus also relates to a method for 
preparing spinosyn precursors and spinosyn derivatives. 

It is possible, for example, to use the nucleic acids 
30 of the invention for preparing novel spinosyn 
derivatives with modifications of the spinosyn aglycone 
by combinatorial biosynthesis. This may be achieved, 
for example, by exchanging the acyltransf erase domain 
which is encoded by ORF 19 and incorporates an acetate 
35 unit for an acyltransf erase domain which incorporates a 
propionate unit. Likewise, the ORF 18 acyltransf erase 
domain incorporating an acetate unit can be exchanged 
for an acyltransf erase domain incorporating a 



. j 
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propionate unit. It is further possible for both or ; 
each of the ketoreductase domains encoded by the two . j 

ORFs mentioned to be inactivated, be replaced by an 
inactive ketoreductase domain or be deleted, and as a 
5 result a . hydroxy! group can be produced 
biosynthetically at the appropriate position in the 
macrocycle. All acyltransf erase , ketoreductase, 
dehydratase, enoylreductase, p-ketoacyl : acyl carrier 
protein and thioesterase domains may be replaced 

10 individually or in any combination by appropriate 
polyketide synthase domains with different substrate 
specificity or reaction specificity, may be fused to 
one another in any. combination, mutated individually or 
in any combination, deleted or duplicated. It is 

15 further possible to exchange module-encoding sequences. 
Thus it is conceivable to replace the module 2-encoding 
DNA sequence (Fig. 6) for the module 1- or module 3, 4, 
5, 6, .7, 8- or module 9-encoding DNA sequence (Fig. 6) 
and to express them functionally. It is also 

20 conceivable to exchange the module 2-encoding DNA 
sequence or any other module-encoding DNA sequence of 
the spinosyn polyketide synthase gene cluster for a 
different module-encoding DNA sequence of the spinosyn 
polyketide synthase gene cluster which incorporates a 

25 different biosynthetic elongation unit. Moreover, any 
other module-encoding DNA sequence of the spinosyn 
polyketide synthase gene cluster may be exchanged for a 
different module-encoding DNA sequence of a different 
polyketide synthase nucleic acid sequence from 

30 S. spinosa or any organism other than S. spinosa such 
^as, for example, Saccharopolyspora erythraea. These 
modifications may be carried out by making use of ET 
recombination (WO 99/29837; Muyrers et al., 1999) or 
other cloning and recombination techniques. 

35 

The invention thus also relates to all module- or 
domain-encoding nucleic acids which are a natural or 
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genetically engineered component of the spinosyn 
polyketide synthase. 

The term "module", as used herein, means an arrangement 
of three enzyme-catalytically active domains which lead 
to elongation of the growing polyketide chain by a 
biosynthetic elongation unit. These domains are a 
P-ketoacyl : acyl carrier protein synthase domain, an 
acyltransf erase domain and a p-ketoacyl : acyl carrier 
protein domain. A module may also carry a ketoreductase 
domain, a dehydratase domain, an enoylreductase domain 
and a thioesterase domain. A "loading" module, at the 
start of biosynthesis is able to carry from the 
mentioned domains only an acyltransf erase and a 
P-ketoacyl : acyl carrier protein domain and also an 
enzymatically inactive p-ketoacyl : acyl carrier protein 
synthase domain. A polyketide synthase domain comprises 
any one of these mentioned enzymatic activities. 

Furthermore, the nucleic acids of the invention may be 
used for preparing libraries of recombinant polyketide 
synthase nucleic acid sequences, recombinant polyketide 
synthase proteins or recombinantly generated 
polyketides in the course of a combinatorial 
biosynthesis by rearrangement and expression of 
spinosyn polyketide synthase nucleic acid sequences or 
by combination and expression together with polyketide 
synthase nucleic acid sequences of a different, 
polyketide synthase-encoding nucleic acid sequence from 
S. spinosa or another organism such as, for example, 
Saccharopolyspora erythaea. These polyketides may be 
glycosylated by using the nucleic acids of the 
invention or by using other nucleic acids whose 
derivable products are involved in the biosynthesis of 
other sugars and coupling to the aglycone. It is known 
that aglycone glycosylat ion plays a decisive part in 
the biological activity at the site of action. The said 
modifications may be carried out both in natural and in 
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genetically engineered spinosyn-producing organisms, in 
particular bacteria. It is furthermore possible to 
carry out the said modifications by making use of ET 
recombination (WO 99/29837; Muyrers et al . , 1999) or 
other cloning and recombination techniques. 

The nucleic acids of the invention, vectors and 
regulatory or genetically mobile regions may also be 
used for finding genes coding for polypeptides, which 
10 encode functionally similar polyketide synthases or 
functionally similar products involved in biosynthesis 
of a sugar. 

Since the nucleic acids of the invention make up an 
15 extensive part of the S. spinosa genome, the nucleic 
acids of the invention may be employed as markers for 
the sequencing of the 5. spinosa genome; this makes the 
arrangement of part sequences of a genome sequencing 
project considerably easier. 

20 

Thus the nucleic acids of the invention provide data 
which can be employed within the framework of a genome 
sequencing project and metabolic engineering, based 
thereupon, for increasing spinosyn production. 

25 

Description of the figures: 

Figure 1 : Model for the biosynthesis of spinosyn sugars 
D-f or os amine and 2,3, 4-tri-O-methyl-L-rhamnose . 

30 

Figure 2: Location of DNA region 1 (SEQ ID NO: 4) and 
DNA region 2 (SEQ ID NOS : 5 and 6) which are directly 
or indirectly involved in spinosyn biosynthesis. The 
black bars in the bottom part of the figure indicate 
35 schematically the positions of the cosmid DNA inserts 
to one another and in relation to DNA regions 1 and 2. 
The cosmid inserts shown were used for sequencing SEQ 
ID NOS: 1 to 3. 



5 
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Figure 3: Schematic representation of the location of 
the inserted DNA (black bars in the bottom part of the 
figure) of the said cosmids which have been used for 
5 attaching a forosamine residue or a trimethylrhamnose 
residue by biotransformation of the spinosyn aglycone 
and spinosyn pseudoaglycone. 

Figure 4 : Schematic representation of the open reading 
10 frames (ORFs) of DNA region 3 corresponding to SEQ ID 
NO: 51 on cosmid 16-2-2. 

Figure 5: Schematic representation of open reading 
frames (ORFs ) of DNA regions 1 and 2. The ORFs are 
15 numbered from 1 to 22, corresponding to SEQ ID NOS : 7, 
9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 
37, 39, 41, 43, 45, 47 and 49. 

Figure 6: Schematic representation of open reading 
2 0 frames (ORFs) of DNA region 2 (SEQ ID NOS: 5 and 6) and 
derivable modules and domains. SM, starting module; Ml 
to M10, module 1 to module 10; KS, (3-ketoacyl : acyl 
carrier protein synthase; AT, acyltransf erase ; ACP, 
(3-ketoacyl : acyl carrier protein; KR, ketoreductase ; DH, 
25 dehydratase; ER, enoylreductase ; TE, thioester ase . 

Figure 7 : Schematic representation of the location of 
DNA inserted in the BAC shuttle clone as black bars in 
the bottom part of the figure. The size of the inserted 
30 DNA is at least 100 kb. Unbroken bars: DNA sequence is 
identical with parts of DNA region 1 and the entire DNA 
region 2 (P11/G6- and P11/B10) or with the entire DNA 
region 1 and parts of DNA region 2 (P8/G11) . Broken 
bars: DNA sequence is outside the sequenced region. 

35 

Examples 



Bacterial strains and plasmids 
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Escherichia coli XLl-Blue MRF' and the cosmid vectors 
SuperCosl (Stratagene, Europe) and pOJ446 (Biermann et 
al . , 1992) were used for establishing gene libraries of 
S. spinosa ATCC49460 (American Type Culture Collection, 
5 U.S.A., EP-A 0 375 316). E. coli JM110 (Stratagene, 
Europe) was used for propagating plasmids which were 
transferred to Streptomyces by transformation. 
Streptomyces albus J1074 (Chater and Wilde, 1980; John 
Innes Institute in Norwich, UK) was used for 
10 heterologous expression of and biotransformation with 
spinosyn biosynthesis genes. 

The plasmids pBeloBACll (Kim et al., 1996) and pOJ446 
(Biermann et al., 1992) were used for preparing an 
15 E . coli - Streptomyces BAC shuttle vector. 

Molecular biological methods 

Molecular biological methods such as DNA restriction, 
20 agarose gel electrophoresis of DNA, ligation of 
restriction fragments, culturing and transformation of 
E . coli, were carried out as described in Sambrook et 
al., (1989). Plasmids were isolated using the Qiagen 
Plasmid Kit (Qiagen, Hilden, Germany) . The enzymes used 
25 were from Roche Diagnostics GmbH (Mannheim, Germany) . 

Cultivation conditions and molecular-genetic methods 
for S. spinosa and streptomycetes are described in 
Hopwood et al., 1985. All S. spinosa or streptomycete 
30 cultivations in liquid culture were carried out 
aerobically in Erlenmeyer flasks at 28°C. 

DNA- DNA hybridizations were carried out using the DIG- 
High-Prime DNA labelling and detection kit according to 
35 the manufacturer's information (Roche Diagnostics GmbH, 
Mannheim, Germany) . 



Growth media: 



10 



35 
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LB Sambrook et al . , 1989 

TS Difco order number 0 370-17-3 (Difco. Detroit, MI, 
USA) 

R5A Illing et al . , 1985 

Preparation of a cosmid S, spinosa gene library 



In order to obtain an S. spinosa gene library, 
chromosomal S. spinosa ATCC49460 DNA was partially cut 
with Mbol and fractionated by centrif ugation in a 
glucose density gradient. Cosmid DNA (SuperCosl, 

15 Stratagene Europe) was prepared according to the 
manufacturer's information, ligated with S. spinosa DNA 
fragments of between 35 and 45 kb and packaged into 
phage particles with the aid of the Gigapack packaging 
system (Stratagene Europe) . Transfection was carried 

20 out in E. coli XL-1 blue MRF' . Likewise, this method 
was used for the purpose of constructing a second 
S. spinosa gene library using the E. coli-Streptomyces 
shuttle cosmid* pOJ4 4 6 . 

25 Sequencing of the spinosyn biosynthesis qene cluster 



and a DNA fragment 


outside 


this 


cluster, whose 


products, however are 


involved 


in the 


biosynthesis of 



spinosyn 

30 The inserted DNA of SuperCosl cosmids 16-1-8, 16-59-1 
and 16-59-8 was sequenced. An approx. 4 kb gap between 
cosmids 16-59-1 and 16-1-8 was closed by sequencing a 
corresponding part region of cosmid 16-59-6 using 
primer walking. 



An approx. 2.3 kb DNA sequence on the SuperCosl cosmid 
16-2-2 was sequenced. 
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Identification and characterization of chromosomal DNA 



fragments 


from an S. spinosa BAC 


shuttle vector 


gene 


library, 


which carry spinosyn 


biosynthesis 


gene 



sequences 

The BAC shuttle vector which can be transferred into 
and propagated not only in E. coll but also in 
actinomycetes such as Streptomyces was prepared by 
linearizing vector pBeloBACll with Xhol and preparing 

10 blunt DNA ends by applying Klenow polymerase. An 
approx. 6 kb Dral - EcoRV DNA fragment of cosmid vector 
pOJ446, which carries the replication origin of plasmid 
SCP2*, the apramycin resistance gene and also orlT for 
conjugative transfer, was ligated with the linearized 

15 BAC vector. The resulting vector was denoted pEBZ333. 

A BAC gene library was established by starting from 
partially Mbol-cut genomic DNA of strain S. spinosa 
ATCC49460 and the Ba/nHI-cut vector pEBZ333. 

20 

Analysis and annotation of open reading frames of DNA 
sequences directly or indirectly involved in spinosyn 
biosynthesi s 

25 Starting from the sequence according to SEQ ID NOS : 1 
to 3, open reading frames (ORFs) which are directly or 
indirectly involved in the biosynthesis of spinosyn 
were identified. These ORFs were divided into two DNA 
regions which are denoted DNA region 1 and DNA region 2 

30 (Figs. 2 and 5) and which carry sequences according to 
SEQ ID NO: 4 or 5 and 6. DNA region 1 carries open 
reading frames whose products are involved in 
modification and tricycle formation of the spinosyn 
aglycone, whereas DNA region 2 (Figs. 2, 5 and 6) 

35 comprises open reading frames whose products encode 
spinosyn polyketide synthase. The two in each case 
first nucleotides of these DNA regions are located 
immediately next to one another (Figs. 2, 3 and 5) . 
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. Another DNA region 3 (SEQ ID NO: 51) is located outside 
this cluster of DNA sequences and carries open reading 
frames whose products are likewise involved in the 
5 biosynthesis of the spinosyn sugar trimethylrhamnose . 

Preparation of the spinosyn aglycone and 17-pseudo- 
aglycone from . Tracer® 

10 Starting from 18.7 g of the commercially available 
product Tracer®, 8.92 g. of spinosyns A and D were 
obtained in a ratio of 82:18 after f reeze-drying and 
column chromatography on silica gel. 

15 The amino sugar forosamine was hydrolysed with 2.7 N 
sulphuric acid in ethanol under reflux. In the process, 
the. majority of the spinosyn A/D 17-pseudoaglycone 
precipitated. Besides further 17-pseudoaglycone, small 
to medium amounts of spinosyn aglycone were found in 

20 the filtrate, depending on the reaction time. 

A complete hydrolysis to the aglycone was achieved 
under somewhat, more drastic conditions (7.2 N sulphuric 
acid in methanol under reflux) . The aglycone fraction 

25 contained exclusively spinosyn A aglycone. This 
correlates very well with the literature (Creemer et 
al., 1998) in which complete decomposition of the 
spinosyn D pseudoag.lycone under corresponding reaction 
conditions is described. According to the authors, the 

30 reason may be that the 5,6 double bond in spinosyn D is 
protonated more readily, with the formation of a 
tertiary carbocation followed by rearrangements. 



Thus it was possible, starting 
commercially available Tracer®, to 
spinosyn A aglycone. 



from 18.7 g 
prepare 3.0 g 



of 
of 
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Tracer (Dow AgroScience) 
18.7 g 



Freeze-drying 



grey solid 
10.0 g 



1" 



chromatography 




MeO Me 



Spinosyn A/D (A. R = H. 82 %; D. R = CH,, 18 %) 
8.92 g 



J 2.7NH 2 SO«/ElOH 
I 3 h reflux 



69% 



16% 



OH 



MeO Me 



OH 





Spinosyn A/D 17-pseudoaglycone 
(A. R - H, 82 %; D, R = CH 3 . 18 %) 



Spinosyn A aglycone 



64% 



J 



7.2NH 2 S0 4 /MeOH 
3 h reflux 



Production of spinosyn A/D from Tracer® 



10 



Freeze-drying of 18.7 g of Tracer® gave 10.0 g of a 
grey solid. After column chromatography of this solid 
on 800 cm 3 of silica gel (eluent: dichloromethane/ 
methanol 95:5), 8.92 g of pure spinosyn A/D (82% A, 18% 
D) were obtained. 

- DC: jR f (Si0 2/ dichloromethane /methanol 9:1) = 0.46. 

- 1 H-NIXIR : CDCI3 , 5 - 6.77 (s, 13-H); 5.88 (d, 5-H of 
spinosyn A); 5.80 (m, 6-H of spinosyn A); 5.49 (m, 5-H 
of spinosyn D) ; 4.87 (d, l'-H); 4.67 (m, 21-H); 4.43 
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(d,. 1"-H) ; 4.31 (m, 9-H) and others - LC/MS: 
electrospray >, positive; peak at RT 44.0 min: m/z = 733 
(100%) [M+H] + (spinosyn A); peak at 44.7 min: m/z = 747 
(100%) [M+H] + (spinosyn D) . 
5 . • . 

Preparation of spinosyn A/D 17-pseudoaglycone: 

8.65 g (11.81 mmol) of spinosyn A/D were dissolved in 
61 ml of ethanol and mixed with 104 ml water and 208 ml 

10 of 4 N H 2 S0 4 . After heating under reflux for 3 h, the 
precipitated solid (A) was removed by filtration and 
worked, up separately from filtrate (B) . The solid (A) 
was washed, with 1 N H 2 S0 4 , taken up in 140 ml of 
dichloromethane, washed successively with saturated 

15 NaHC03 solution and saturated NaCl solution, dried over 
Na 2 S0 4 and concentrated in vacuo. Recrys tallization from 
ethanol gave 3.03 g of spinosyn A/D 17-pseudoaglycone 
and mother liquor (C) . The filtrate (B) was extracted 
several times with dichloromethane. The extracts were 

20 washed successively with saturated NaHC0 3 solution and 
saturated NaCl solution, dried over Na 2 S0 4 and 
concentrated in vacuo. The residue was combined with 
the mother liquor (C) concentrated in vacuo and was 
fractionated by column chromatography on 650 cm 3 silica 

25 gel (eluent: cyclohexane/ethyl acetate 1:1, then 100% 
ethyl acetate). Besides another 1.76 g of spinosyn A/D 
17-pseudoaglycone, 0.78 g (16%) of spinosyn A aglycone 
were obtained. The total yield of spinosyn A/D 
17-pseudoaglycone was 4.79 g (69%). - a) Spinosyn A/D 

30 17-pseudoaglycone (82% A, 18% D) ; DC: R t (Si0 2 , ethyl 
acetate) - 0.48. - X H-NMR: CDC1 3 , 5 = 6.78 (s, 13-H) ; 
5.88 (d, 5-H of spinosyn A); 5.80 (m, 6-H of spinosyn 
A); 5.49 (m, 5-H of spinosyn D) ; 4.86 (d, l'-H); 4.70 
(m, 21-H); 4.32 (m, 9-H) and others. - LC/MS: 

35 electrospray positive; peak at RT 40.7 min: m/z = 609 
(100%) [M+NH 4 ] + , m/z = 641 (10%) [M+NH 4 +CH 3 +OH ] + 
(spinosyn A pseudoaglycone) ; peak at RT 41.4 min: m/z = 
623 (100%) [M-fNH 4 ]\ m/z = 655 (8%) [M+NH 4 CH 3 OH ] + 
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(spinosyn D pseudoaglycone) . - b) Spinosyn A aglycone: 
DC: R t (Si0 2 , ethyl acetate) = 0.29. - r H-NMR: CDC1 3 , 8 = 
6.80 (s, 13-H); 5.89 (d, 5-H) ; 5.80 (m, 6-H) ; 4.70 (m, 
21-H); 4.44 (m r 9-H) and others. - LC/MS: electrospray 
5 positive; peat at RT 36.8 min: m/z = 420 (100%) 
[M+NH 4 ] + , m/z = 452 (10%) [M+NH 4 +CH 3 OH] + . 

Preparation of spinosyn A/D aglycone 

10 4.30 g (7.29 mmol) of spinosyn A/D pseudoaglycone were 
dissolved in 190 ml of methanol and mixed with 285 ml 
of 7.2 N H 2 S0 4 . After heating under reflux for 3 h, the 
cooled reaction mixture was carefully added to 1 700 ml 
of saturated NaHC0 3 solution. The mixture was extracted 

15 with diethyl ether, washed successively with saturated. 
NaHC0 3 solution and saturated NaCl solution, dried over 
Na 2 S0 4 and concentrated in vacuo. After column 
chromatography of this solid on 650 cm 3 of silica gel 
(eluent: cyclohexane/ethyl acetate 1:2, then 100% ethyl 

20 acetate), 1.88 g (64%) of spinosyn A aglycone were 
obtained. - DC: R t (Si0 2 , ethyl acetate) - 0.29. 

- X H-NMR: CDC1 3 , 5 = 6.80 (s, 13-H); 5.89 (d, 5-H) ; 5.80 
(m, 6-H) ; 4.70 (m, 21-H) ; 4.44 (m, 9-H); and others. 

- LC/MS: electrospray positive; peak at RT 36.6 min: 
25 m/z = 420 (100%) [M+NH 4 ] + , m/z = 452 (14%) [M+NH 4 +CH 3 OH ] + 

(spinosyn A aglycone) . 

Forosaminylation of the spinosyn aglycone and 



attachment 


of 


a trimethylrhamnose sugar to 


the spinosyn 


30 aglycone 


by 


biotransformation with a 


recombinant 


Streptomyc 


es 


strain expressing heterologou 


sly spinosyn 



sugar biosynthesis genes 

20 ml of R5A medium (Illing et al . , 1989) with 5 pg/ml 
35 apramycin were inoculated with mycelium of the 
recombinant strain S. albus (165-1) or S. albus (165-8) 
and incubated aerobically at 28°C for 24 h. To this 
culture, 50 pg/ml of the prepared spinosyn aglycone 
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(100 \il of a 1% strength stock solution in methanol; 
for preparation see section "Preparation of spinosyn 
aglycone and 17-pseudoaglycone from Tracer®", 
"Production of spinosyn A/D from Tracer®" and 
5 "Preparation of spinosyn A/D aglycone") were added and 
the mixture was incubated aerobically at 28 °C for 
approx . 120 h . As a control, S. albus (pEBZ340; vector 
pOJ4 4 6 with an approx. 1.8 kb, spinosyn-PKS carrying 
DNA fragment from cosmid 16-1-8) was cultured in the 
10 same way and mixed with spinosyn aglycone. After 
incubation, the cultures were centrifuged to remove 
cellular mycelium, and the supernatant (20 ml) was 
mixed with 25 ml of methanol. 

15 35 ml of the culture supernatant mixed with methanol 
were in each case lyophilized, taken up in 15 ml of 
water and extracted twice with in each case 10 ml of 
ethyl acetate. The combined organic phases were 
evaporated to dryness and taken up in 350 jil of 

20 methanol. An aliquot of these extracts was analysed by 
means of LC/MS with positive ionization 

electrospraying . 

The supernatant of the S. albus (165-1) culture 
25 contained a compound having the molecular weight of a 
f orosaminylated spinosyn A aglycone and also 
spinosyn A. 

Peak 1: RT = 41.0 min : m/z = 544 (100%) [M+H] + , m/z = 
576 (16%) ■ [M+H+CH3OH] + ( f orosaminylated spinosyn A 
30 aglycone); LC/MS/MS: m/z = 142 (38%) (forosamine 
fragment ) . 
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Forosaminylated spinosyn A aglycone 
Molecular weight = 543 
Molecular formula = C32H 49 N0 6 

Peak 2: RT = 44.2 min: m/z = 733 (100%) [M+H] + 
(spinosyn A); LC/MS/MS: m/z = 142 (21%) (forosamine 
fragment) . 



10 



15 



The S. albus (165-8) culture supernatant contained 
compound having the molecular weight of 
forosaminylated spinosyn A aglycone. 

Peak 1: RT = 40.9 min: m/z = 544 (100%) [M+H] + , m/z 
576 (16%) [M+H+CH 3 OH] + (forosaminylated spinosyn 



a 
a 



aglycone) ; 
fragment ) . 



LC/MS/MS: m/z = 142 (39%) (forosamine 



20 



25 



30 



The S. albus (pEBZ340) culture supernatant contained no 
compounds of MW 543 and no spinosyn A. 

Forosaminylation of spinosyn 17 -pseudoaglycone by 
biotransformation with a recombinant Streptomyces 
strain expressing heterologously spinosyn sugar 
biosynthesis genes 

20 ml of R5A medium (Illing et al . , 1989) with 5 yig/ml 
apramycin were inoculated with mycelium of the 
recombinant strain S. albus (165-1) or S. albus (165-8) 
and incubated aerobically at 28°C for 24 h. To this 
culture, 50 yg/ml of the prepared spinosyn 17-pseudo- 
aglycone (100 yl of a 1% strength stock solution in 
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methanol; for preparation see section "Preparation of 
. spinosyn aglycone and 17-pseudoaglycone from Tracer®", 
"Production of spinosyn A/D from Tracer®" and 
"Preparation of spinosyn 17-pseudoaglycone" ) was added 
5 and the mixture was incubated aerobically at 28°C for 
approx. 120 h. After incubation, the cultures were 
centrifuged to remove . cellular mycelium, and the 
supernatant (20 ml) was mixed with 25 ml of methanol. 

10 35 ml of the culture supernatant mixed with methanol 
were in each case lyophilized, taken up in 15 ml of 
water and extracted twice with in each case 10 ml of 
ethyl acetate. The combined organic phases were 
evaporated to dryness and taken up in 350 \il of 

15 methanol. An aliquot of these extracts was analysed by 
means of LC/MS with positive ionization 

electrospraying . 

The S. albus (165-1) culture 
20 traces of spinosyns A and. D. 

Peak 1: RT = 44.2 min: m/z 
(spinosyn A); LC/MS/MS: m/z = 
fragment) . 

Peak 2: RT . = 4 4.7 min: m/z 
25 (spinosyn D) ; LC/MS/MS: m/z = 
fragment) . 

The S. albus (165-8) culture supernatant contained 
traces of spinosyns A and D. 
30 Peak 1: RT = 44.1 min: m/z = 733 (100%) [M+H] + 
( spinosyn A) . 

Peak 2: RT = 44.7 min: m/z z = 747 (100%) [M+H] + 
(spinosyn D) . 

3 5 Deposition of microorganisms 

The following microorganisms and plasmids have been 
deposited with the Deutsche Sammlung von 



supernatant contained 

= 733 (100%) [M+H] + 
142 (8%) (forosamine 

= 747 (100%) [M+H] + 
142 (37%) (forosamine 
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Mikroorganismen und Zellkulturen GmbH (DSMZ) , 
Mascheroder Weg lb, D-38124 Brunswick, in agreement 
with the requirements of the Budapest Treaty. 



Microorganism and plasmid 

coli XLl-Blue MRF' with cosmid 
16-1-8 

E. coli XLl-Blue MRF' with cosmid 
16-2-2 

E. coli XLl-Blue MRF' with cosmid 
16-59-1 

E. coli XLl-Blue MRF' with cosmid 
16-59-6 

E. coli XLl-Blue MRF' with cosmid 
16-59-8 

E. coli XLl-Blue MRF' with cosmid 
165-1 • 

E. coli XLl-Blue MRF' with cosmid 
165-8 

E. coli DH10B with the BAC shuttle 
clone P8/G11 

E. coli DH10B with the BAC shuttle 
clone P11/B10 

E. coli DH10B with the BAC shuttle 
clone P11/G6 



Deposition Date 
number 

DSM 12961 1999-08-02 

DSM 12962 1999-08-02 

DSM 12963 1999-08-02 

DSM 12964 1999-08-02 

DSM 12965 1999-08-02 

DSM 13005 1999-08-18 

DSM 13007 1999-08-18 

DSM 13012 1999-08-18 

DSM 13011 1999-08-18 

DSM 13010 1999-08-18 
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